Index Coding and Error Correction 



Son Hoang Dau, Vitaly Skachek, and Yeow Meng Chee 
Division of Mathematical Sciences, School of Physical and Mathematical Sciences 
Nanyang Technological University, 21 Nanyang Link, Singapore 637371 
Emails: { DauS0002, Vitaly . Skachek, YMChee } gntu.edu.sg 



o 



> 

00 
(N 

(N 



X 



Abstract — A problem of index coding with side information was 
first considered by Y. Birk and T. Kol (IEEE INFOCOM, 1998). In 
tlie present work, a generalization of index coding scheme, where 
transmitted symbols are subject to errors, is studied. Error- 
correcting methods for such a scheme, and their parameters, are 
investigated. In particular, the following question is discussed: 
given the side information hypergraph of index coding scheme 
and the maximal number of erroneous symbols S, what is the 
shortest length of a linear index code, such that every receiver 
is able to recover the required information? This question turns 
out to be a generalization of the problem of finding a shortest- 
length error-correcting code with a prescribed error-correcting 
capability in the classical coding theory. 

The Singleton bound and two other bounds, referred to as 
the a-bound and the K-bound, for the optimal length of a 
linear error-correcting index code (ECIC) are established. For 
large alphabets, a construction based on concatenation of an 
optimal index code with an MDS classical code, is shown to 
attain the Singleton bound. For smaller alphabets, however, this 
construction may not be optimal. A random construction is also 
analyzed. It yields another inexplicit bound on the length of an 
optimal linear ECIC. Finally, the decoding of linear ECIC's is 
discussed. The syndrome decoding is shown to output the exact 
message if the weight of the error vector is less or equal to the 
error-correcting capability of the corresponding ECIC. 



I. Introduction 



A. Background 



The problem of Index Coding with Side Information (ICSI) 
was introduced by Birk and Kol fl ). During the transmission, 
each client might miss a certain part of the data, due to 
intermittent reception, limited storage capacity or any other 
reasons. Via a slow backward channel, the clients let the server 
know which messages they already have in their possession, 
and which messages they are interested to receive. The server 
has to find a way to deliver to each client all the messages he 
requested, yet spending a minimum number of transmissions. 
As it was shown in 1 1 1, the server can significantly reduce the 
number of transmissions by coding the messages. 

Possible applications of index coding include communica- 
tions scenarios, in which a satellite or a server broadcasts a set 
of messages to a set cUents, such as daily newspaper delivery 
or video-on-demand. Index coding with side information can 
also be used in opportunistic wireless networks Q. 

The ICSI problem has been a subject of several recent 
studies f3l-f8l. This problem can be viewed as a special case 
of the Network Coding (NC) problem [9), HO). In particular, 
as it was shown in 171, every instance of the NC problem can 
be reduced to an instance of the ICSI problem. 



B. Our contribution 

In this work, we generalize the ICSI problem towards a 
setup with error correction. We extend some known results 
on index coding to a case where any receiver can correct up 
to a certain number of errors. The problem of designing such 
error-correcting index codes (ECIC's) naturally generalizes the 
problem of constructing classical error-correcting codes. We 
establish an upper bound (the K-bound) and a lower bound 
(the a-bound) on the shortest length of a linear ECIC, which 
is able to correct any error pattern of size up to S. We also 
derive an analog of the Singleton bound, and show that this 
bound is tight for codes over large alphabets. We also consider 
random ECIC's. By analyzing their parameters, we obtain an 
upper bound on their length. Finally, we discuss the decoding 
of linear ECIC's. We show that the syndrome decoding results 
in a correct result, provided that the number of errors does not 
exceed the error-correcting capability of the code. 

The problem of error correction for NC was studied in 
several previous works. However, these results are not di- 
rectly applicable for the ICSI problem. First, the existing 
works only consider the multicast scenario, while the ICSI 
problem, however, is a special case of the non-multicast NC 
problem. Second, the ICSI problem can be modeled by the 
NC scenario lH, yet, this requires that there are directed edges 
from particular sources to each sink, which provide the side 
information. The symbols transmitted on these special edges, 
unlike for error-correcting NC, are not allowed to be corrupted. 

II. Preliminaries 

Let ¥q be the finite field of q elements, where g is a power 
of prime, and F* = ¥g\{0}. Let [n] = {1,2,..., n}. For the 
vectors u,v ^ F^, we use d{u, v) to denote the the Hamming 
distance between u and v. If m G F^' and M C F^' is a set 
of vectors (or a vector subspace), then this notation can be 
extended to 

d(u, M) = min d(u, v) . 
veM 

Given q, k, and d, let Nq[k, d] denote the length of the shortest 
linear code over F^ which has dimension k and minimum 
distance d. The support of a vector m G F^ is defined by 
supp(tt) = {i E [n] : Ui ^ 0}. The Hamming weight of u is 
defined by \nX{u) = |supp(it)|. Suppose E C [n]. We write 
u <\ E whenever supp(M) C E. 

We use e, = (0, . . . , 0, 1, 0, . . . , 0) G F'^' to denote the 

n—i 

unit vector, which has a one at the zth position, and zeros 



2 



elsewhere. For a vector y = (j/i, 7/2, • ■ • , J/n) and a subset 
B = {ii, 12, • ■ • , ib} of [n], where ii < 12 < • • • < ib, let 
denote the vector {yi^ , i/ia , • . • , j/i J. 

For an n X matrix L, let denote its ith row. For a 
set E Q [n], let denote the \E\x N matrix obtained from 
L by deleting all the rows of L which are not indexed by 
the elements of E. For a set of vectors M, we use notation 
spsn{M) to denote the linear space spanned by the vectors 
in M. We also use notation colspan(i) for the linear space 
spanned by the columns of the matrix L. 

Let Q ~ (V, E) be a graph with a vertex set V and an edge 
set £. A directed graph Q is called symmetric if 

(m, u) e 5 ^ (u, m) e f . 

The independence number of an undirected graph 5 is de- 
noted by a{Q). There is a natural correspondence between 
undirected graphs and directed symmetric graphs. By using 
this correspondence, the definition of independence number is 
naturally extended to directed symmetric graphs. 

in. Error-Correcting Index Coding with Side 
Information 

Index Coding with Side Information problem considers 
the following communications scenario. There is a unique 
sender (or source) S, who has a vector of messages x = 
{xi,X2, ■ ■ ■ , Xn) in his possession. There are also m receivers 
Ri, R2, ■ ■ ■ , Rm, receiving information from S via a broadcast 
channel. For each i G [m], Ri has side information, i.e. i?j 
owns a subset of messages {xj}j^Xi, where Xi C [n]. Each 
Ri, i € [to], is interested in receiving the message Xf(i) (we 
say that Ri requires x^fi-f), where the mapping / : [to] — > [n] 
satisfies f{i) ^ Xi for all i G [to]. Hereafter, we use the 
notation X = {Xi, X2, . . . , Xm)- An instance of the ICSI 
problem is given by a quadruple {171,11, X , f). An instance 
of the ICSI problem can also be conveniently described by 
the following directed hypergraph 

Definition 3.1: Let (to, n, X, f) be an instance of the ICSI 
problem. The corresponding side information (directed) hyper- 
graph H = 7^ (to, n, X, /) is defined by the vertex set V — [n] 
and the edge set £-u, where 

-{(/(«), ^0 : ^e [n]}. 

We often refer to (to, n, X, f) as an instance of the ICSI 
problem described by the hypergraph 

Each side information hypergraph H = {V,£'h) can be 
associated with the directed graph Q-h — (V, £) in the 
following way. For each directed edge {f{i),Xi) G £-h there 
will be \Xi\ directed edges {f{i),v) G £, for v G Xi. When 
m ^ n and f{i) = i for all i G [to], the graph Q-u is, in fact, 
the side information graph, defined in |3|. 

Due to noise, the symbols received by Ri, i G [to], may be 
subject to errors. Assume that S broadcasts a vector y G F^. 
Let ei G F^ be the error affecting the information received 



by Ri, i G [to]. Then Ri actually receives the vector y^ = 
y + €i G F^, instead of y. 

Definition 3.2: Consider an instance of the ICSI problem 
described by H = 'H{m,n, X, f). A S-error-correcting index 
code ((5, ?^)-ECIC) over Fg for this instance is an encoding 
function 

£ : F';^F^, 

such that for each receiver Ri, i ^ [to], there exists a decoding 
function 

: F^ xFI-^'I ^Fg , 

satisfying 

Va;,e, gF^\ wt(e,;) < (5 : T)^{€{x) + e^,xx,) = xj^i) . 

If 6 — 0, we refer to such € as a non-error-correcting index 
code, or just T-L-IC. The parameter N is called the length of 
the index code. In the scheme corresponding to the code £, S 
broadcasts a vector €{x) of length N over ¥q. 

Definition 3.3: A linear index code is an index code, for 
which the encoding function € is a linear transformation over 
Fg. Such a code can be described as 

Va; G F^ : €(cc) = xL , 

where i is an n x matrix over Fg. The matrix L is called the 
matrix corresponding to the index code £, while € is referred 
to as the linear index code based on L. 

Definition 3.4: An optimal linear {6,'H)-ECIC over Fg is 
a linear {d,'H)-EClC over Fg of the smallest possible length 

Hereafter, we assume that X = (^i),;e[m] is known to S. 
We also assume that the code <B is known to each receiver Ri, 

i G [to]. 

Definition 3.5: Suppose TL = 'H(m,n,X,f) corresponds 
to an instance of the ICSI problem. Then the min-rank of % 
over Fg is defined as 

Kg('H) = min{rankF,({fi + e/(j)}ig[„]) : 

V, G F^ , V, < Xi] . 

Observe that Kg(H) generalizes the min-rank over Fg of the 
side information graph, which was defined in [ 3 1 . More specif- 
ically, when TO = n and f{i) = i for all i G [to], Qu becomes 
the side information graph, and KqiT-L) = min-rankg(C?^). The 
min-rank was shown in |3|, [4| to be the smallest number of 
transmissions in a linear index code. 

Lemma 3.1: ( jS), ifTTI ') Consider an instance of the ICSI 
problem described by H = T-L{ra, n, X, /) . 

1) The matrix L corresponds to a linear 'H-IC over Fg if 
and only if for each i G [to] there exists Vi G F^' such 
that Vi <l Xi and Vi + ef[i) G colspan(L). 

2) The smallest possible length of a linear T-L-IC over Fg is 
K?(H). 
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IV. Basic Properties 
We define the set of vectors 

I{q, n)^{zeF^ : 3ie [m] s.t. zx, = 0, ^ O} . 

For all i G [m], we also define yi ^ [n]\(^{f{i)}U Xi^ . Then 
the collection of supports of all vectors in 2{q, T-L) is given by 



Then, the matrix L corresponds to a {S,'H)-ECIC over if 
and only if 



Jm= U : Y^^y,}. 



(1) 



i € [m] 



Lemma 4.1: The matrix L corresponds to a ((5, H)-ECIC 
over Fg if and only if 



wt {zL) > 2(5 + 1 for all z £ I{q, U) 



(2) 



Equivalently, L corresponds to a ((5, H)-ECIC over F^ if and 
only if 




> 25 + 1, 



(3) 



for all K e J{n) and for all choices of z, G F*, i e K. 
Proof: For each a; G F^\ we define 

B{x, S) = {ye¥^ : y = xL + e,ee¥^, wt(e) < S} , 

the set of all vectors resulting from at most 6 errors in the 
transmitted vector associated with the information vector x. 
Then the receiver Ri can recover a;j(j) correctly if and only 
if 

B{x,6)nB{x',S) ^ 0, 
for every pair x,x' G F^ satisfying: 

^x, = x'x^ and Xf(^i) ^ a;'y|-j) . 

(Observe that Ri is interested only in the bit a;/(i), not in the 
whole vector x.) 

Therefore, L corresponds to a {6, H)-ECIC if and only if 
the following condition is satisfied: for all i G [m] and for all 
x,x' G F^' such that xxi = x'^, and xyj-j) ^ ^/(i)' it holds 

Ve,e' G F^, wt(e) S, wt(e') 6 : 

xL + e^x'L + e' . (4) 

Denote z = x' — x. Then, the condition in ^ can be 
reformulated as follows: for all i G [n] and for all 2; G F^ 
such that zxi = and zj(i) 7^ 0, it holds 

Ve, e' G F^, wt(e) S, wt(e') S : zL e - e' . (5) 

The equivalent condition is that for all z G I{q, H), 

wt{zL) ^ 2(5 + 1 . 

Inequality (O follows from this condition in a straight-forward 
manner ■ 

Corollary 4.1: For all i G [m], let 

M, =span({ij : j G J^J) . 



Mi G H : <^{Lf(e),M^) > 2(5+1 



(6) 



Example 4.1: Let g = 2, m = n = 3, and /(i) = i for 
i G [3]. Suppose Xi = {2,3}, A^s = {1,3}, and ^"3 = {1,2}. 
Let 

/I 1 1 o^ 
i = 1 1 1 
Vi 1 1, 

Note that L generates a [4,3, 1]2 code, which has minimum 
distance one. However, the index code based on L can still 
correct one error. Indeed, let % — 7i{3, 3, X, /), we have 

i{2,n) = {100,010,001}. 

Since each row of L has weight at least three, it follows 
that wt{zL) > 3 for all z G 1(2, -H). By Lemma 14+1 L 
corresponds to a (1, ?^)-ECIC over F2. 

Example 4.2: Assume that m — n and f{i) — i for all 
i G [m]. Furthermore, suppose that Xi — for all i G [m] 
(i.e. there is no side information available to the receivers). 
Let H = Him,, n, X, /). Then, I{q, K) = F;'\{0}. Hence, by 
Lemma 14.11 the n x N matrix L corresponding to a {5, %)- 
ECIC over Fg (for some integer 5 ^ 0) is a generating matrix 
of an [N,n,'^ 25 + l]g linear code. Thus, the problem of 
designing an ECIC is reduced to the problem of constructing 
a classical linear error-coiTecting code. 

V. The a-BouND and the k-Bound 

Let (m, n, X, f) be an instance of the ICSI problem, and 
let "H be the corresponding side information hypergraph. Next, 
we introduce the following definitions for the hypergraph TL. 

Definition 5.1: A subset H of [71] is called a generalized 
independent set in % if every nonempty subset K of H 
belongs to 

Definition 5.2: A generalized independent set of the largest 
size in % is called a maximum generalized independent set. 
The size of a maximum generalized independent set in % is 
called the generalized independence number, and denoted by 

When m = n and /(i) = i for all i G [n], the generalized 
independence number of T-L is equal to the maximum size of 
an acyclic induced subgraph of Qu, which was introduced 
in Is). In particular, when Qu is symmetric, a{T-L) is the 
independence number of Qu- We omit the proof. 

Theorem 5.1 (a-bound): The length of an optimal linear 
((5, 7^)-ECIC over ¥g satisfies 

Mg{H,6) ^ Ng[a{H),26 + l] . 
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Proof: Consider an n x matrix L, which corresponds 



to a ((5, 7^)-ECIC. Let H ^ {ii,i2, 



} be a maximum 



generalized independent set in H. Then, every subset K C H 
satisfies K e J{T-L). Therefore, 



wt 



z,L, > 2(5 + 1 



and for all choices of £ F*, i £ iiT. 



for &\\K QH,K ^ 

Hence, the a{T-L) rows of L, namely Li-^, Li^, . . . , Li^^^^ 
form a generator matrix of an [N, 0(71), 26 + l]q code. 
Therefore, 

N >Ng[a{n),2S + l] . 



The following proposition is based on the fact that concate- 
nation of a (5-error-correcting code with an optimal (non-error- 
correcting) n-lC yields a {S,'H)-ECIC. 

Proposition 5.2 (K-bound): The length of an optimal 
{6, H)-ECIC over satisfies 

N'qin^S) <N,[Kq{n),25+l] . 

The proof of this proposition is omitted due to lack of space. 

Corollary 5.1: The length of an optimal linear ((5, 7^)-ECIC 
over Fq satisfies 

Nq[a{n),25+\\ <Mq{n,5) <Nq[nq{'H),25+l] . 

Example 5.1: Let q = 2, m ^ n = 5, 6 = 2, and /(i) = i 
for all i £ [m]. Assume 

A'i = {2,5}, A'2 = {1,3}, A'3 = {2,4}, 
A'4 = {3,5}, A'5 = {1,4}. 

Let H ~ H{5,5, X , f ). The side information graph Q-u of 
this instance is a pentagon. It is easy to verify that a{H) = 
a{g) = 2. It follows from Theorem 9 in 14J that K2(H) = 
min-rarLk2(5^i) = 3. Thus, from lfT2l we have 

N2[2,5] = 8 and 7V2[3,5] = 10. 

Due to Corollarv l5.ll we have 

8 < A/'2CH,2) < 10. 

Using a computer search, we obtain that Af2{'H, 2) = 9, and 
the corresponding optimal scheme is based on 



/I 

1 


V 






1 

V 



It is technical to verify that by Lemma 14.11 L corresponds to 
(2, 7^)-ECIC. The length of this ECIC Hes strictly between the 
a-bound and the K-bound. 

Remark 5.1: Example 15.11 illustrates that over small al- 
phabets, the concatenation of an optimal linear (non-error- 
correcting) index code and an optimal linear error-correcting 
code may fail to produce an optimal linear ECIC. 



VI. The Singleton Bound 

Theorem 6.1 (Singleton bound): The length of an optimal 
linear {S, 'H)-ECIC over ¥q satisfies 

Mq{n,5) > Kg{H)+2S . 

Proof: Let L be the n x AfqCH, 6) matrix corresponding 
to some optimal {S,H)-ECIC. Let L' be the matrix obtained 
by deleting any 25 columns from L. 

By Lemma 1431 L satisfies for all z £ X{q,H), 

wt{zL) ^ 2(5 + 1 . 

We deduce that the rows of L' also satisfy that for all z £ 

\Nt{zL') ^ 1 . 

By Lemma [4+1 L' corresponds to a linear H-IC. Therefore, 
by Lemma [TT] part 2, L' has at least Kq{H) columns. We 
deduce that 

A/", (H, (5) -2(5 > KqiH) , 
which concludes the proof. ■ 

The corollary below shows that for sufficiently large al- 
phabets, a concatenation of a classical MDS error-correcting 
code with an optimal (non-error-correcting) index code yields 
an optimal linear ECIC. 

Corollary 6.1 (MDS error-correcting index code): For q > 

Kq{H)+26-l, 

Mqin,d)^Kq{n) + 2S . (7) 

Proof: Follows from Theorem |6T| and Proposition 15. 21 ■ 

Remark 6.1: There exist hypergraph T-L, such that Q-u is the 
(symmetric) odd cycle of length n, for which the a-bound is 
at least as good as the Singleton bound. 

VII. Random codes 

Theorem 7.1: Let TL = H{ni, n, X, f) describe an instance 
of the ICSI problem. Then there exists a {6, 'H)-ECIC over F^ 
of length N if 

V (z"-l^'l-i < — , (8) 



iG[m] 



Vq{N,2S) ' 



where 



25 



VqiN,2S)^Yl 



i=0 



N 



is the volume of the q-aiy sphere in F^. 
Idea of proof: We construct a random n x N matrix L over 
¥q, row by row. Each row is selected independently of other 
rows, uniformly over F^. The result is obtained by bounding 
from above the probability of the event 

U E,, where = {d(X/(,), M,) < 2(5 + 1} , 

i G [rn] 

and by making this probability less than 1. 
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Remark 7.1: The bound in Theorem 17.11 implies a bound 
on Kq{TL), which is tight for some TL. Indeed, fix 6 — 0. Take 

m ^ n = 2e + 1 {£ > 2), and f{i) = i for all i £ [n]. Let 
A"! = [n]\{l,2,n} and = [n\\{l,n- l,n}. For 2 < i < 
n-1, let Xi = [n]\{i + 1}. Take H = nin, n, X, /). 

Then Q-^ is the complement of the (symmetric directed) odd 
cycle of length n. We have \Xi\ = 2£ — 2 for all i G [n]. 
Then (O becomes 

iV> 2 + log^(2£+l) . 

If q > 2i + 1 then we obtain > 3. Observe that in this case 
KqiT-L) = min-rankq(tJ-H) = 3 (see |8, Claim A.l]), and thus 
the bound is tight. 

VIII. Syndrome decoding 

Consider the {S,'H)-ECIC based on a matrix L. Suppose 
that the receiver Ri, i £ [m], receives the vector 

yi = xL + Ei , (9) 

where xL is the codeword transmitted by S, and is the 
error pattern affecting this codeword. 

In the classical coding theory, the transmitted vector c, the 
received vector y, and the error pattern e are related by y = 
c + e. For index coding, however, this is no longer the case. 
The following theorem shows that, in order to recover the 
message Xf(i) from using (|9|l, it is sufficient to find just 
one vector from a set of possible error patterns. This set is 
defined as follows: 

Ci{ei) ^{ei + z : z e s'psn{{Lj}j^y.)} . 

We henceforth refer to the set Ci{ei) as the set of relevant 
error patterns. 

Lemma 8.1: Assume that the receiver Ri receives y^. 

1) If Ri knows the message Xf(i) then it is able to determine 
the set Ci{ei). 

2) If Ri knows some vector e e Ci{ei) then it is able to 
determine Xf^iy 

We now describe a syndrome decoding algorithm for linear 
error-correcting index codes. We have 

y, - xx.Lx, - Ci e span ({!//(,)} U {Lj}j(.y^) . 

Let "^i = span({L^(i)} U {Lj}j^y.), and let Jf*-*-' be a parity 
check matrix of We obtain that 

H^^eJ ^H^^{y,-xx^Lx^f . 

Let be a column vector defined by 

(3,=H^\y,-xx^Lx^f . 

Observe that each Ri is capable of determining /3 j. This leads 
us to the formulation of the decoding procedure for Ri in 
Figure [T] 

Theorem 8.2: Let y^ = xL + be the vector received 
by Ri, and let wt(ei) ^ 5. Assume that the procedure in 



• Input: yi, xx^, L. 

m Step 1: Compute the syndrome 

> Step 2: Find the lowest Hamming weight solution e of 
the system 

• Step 3: Given that xxi = xXi, solve the system for Xf(iy. 

yi = xL + e. 

• Output: Xfi^iy 



Fig. 1: Syndrome decoding procedure. 



Figure [T] is applied to {yi,xxi, L). Then, its output satisfies 

Remark 8.1: It is not impossible that e 7^ e^. However, 
if wt(ei) < S, it can be shown that e G Ci{€i). Hence, by 
Lemma ISTl we have Xf^i-f — Xf^iy 
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